Text to Speech Corpus
Text to speech corpus.
Assamese Text to Speech Corpus
Assamese Text to Speech Corpus 44:49:34 hours | 28.85 GB | 32,594 Audio Segments | 2 Speakers The LDC-IL Assamese Text to Speech dataset comprises audio files in wav format, accompanied by a corresponding textual layer in Assamese script. This dataset spans a duration of 44:49:34 (hh:mm:ss) , consisting of read speech in the studio setup. The data is derived from 01 female and 01 male native Assamese speakers. A comprehensive explanation of dataset can be found in the Assamese Text to Speech Documentation. For any research-based citations, please use the following citations: Syeda Mustafiza Tamim, Prangshu Manjul, Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan. 2025. Assamese Text to Speech Corpus. Central Institute of Indian Languages, Mysore. 978-93-48633-45-3. Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2025. LDC-IL Corpus Insights. Central Institute of Indian Languages, Mysore. 978-93-48633-33-0...
Maithili Text to Speech Corpus
32:42:20 hours | 19.56 GB | 32260 Audio Segments | 2 SpeakersThe LDC-IL Maithili Text to Speech dataset comprises audio files in wav format, accompanied by a corresponding textual layer in Devanagari script. This dataset spans a duration of 32:42:20 (hh:mm:ss) , consisting of read speech in the studio setup. The data is derived from 01 female and 01 male native Maithili speakers. A comprehensive explanation of dataset can be found in the Maithili Text to Speech Documentation.For any research-based citations, please use the following citations:Shantanu Kumar, Dinesh Mishra, Saurabh Varik, Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan. 2025. Maithili Text to Speech Corpus. Central Institute of Indian Languages, Mysore. 978-93-48633-36-1.Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2025. LDC-IL Corpus Insights. Central Institute of Indian Languages, Mysore. 978-93-48633-33-0...